Conflict acts as an implicit cost in reinforcement learning.
نویسندگان
چکیده
Conflict has been proposed to act as a cost in action selection, implying a general function of medio-frontal cortex in the adaptation to aversive events. Here we investigate if response conflict acts as a cost during reinforcement learning by modulating experienced reward values in cortical and striatal systems. Electroencephalography recordings show that conflict diminishes the relationship between reward-related frontal theta power and cue preference yet it enhances the relationship between punishment and cue avoidance. Individual differences in the cost of conflict on reward versus punishment sensitivity are also related to a genetic polymorphism associated with striatal D1 versus D2 pathway balance (DARPP-32). We manipulate these patterns with the D2 agent cabergoline, which induces a strong bias to amplify the aversive value of punishment outcomes following conflict. Collectively, these findings demonstrate that interactive cortico-striatal systems implicitly modulate experienced reward and punishment values as a function of conflict.
منابع مشابه
Adaptive Dynamic Programming Applied to a 6 DoF Quadrotor
This chapter discusses how the principles of Adaptive Dynamic Programming (ADP) can be applied to the control of a quadrotor helicopter platform flying in an uncontrolled environment and subjected to various disturbances and model uncertainties. ADP is based on reinforcement learning. The controller (actor) changes its control policy (action) based on stimuli received in response to its actions...
متن کاملBalancing Energy Expenditure in WSNs through Reinforcement Learning: A Study
This work describes a study of applying reinforcement learning to balance energy expenditure in wireless sensor networks, contributing both with a protocol and a novel observation regarding exploration in reinforcement learning. Our starting point is FROMS, our own multi-source multi-sink routing protocol which exploits learning to identify the lowest cost paths for data delivery. The primary m...
متن کاملA Multiagent Reinforcement Learning algorithm to solve the Community Detection Problem
Community detection is a challenging optimization problem that consists of searching for communities that belong to a network under the assumption that the nodes of the same community share properties that enable the detection of new characteristics or functional relationships in the network. Although there are many algorithms developed for community detection, most of them are unsuitable when ...
متن کاملThe Effect of Explicit and Implicit Instruction through Plays on EFL Learners’ Speech Act Production
Despite the general findings that address the positive contribution of teaching pragmatic features to interlanguage pragmatic development, the question as to the most effective method is far from being resolved. Moreover, the potential of literature as a means of introducing learners into the social practices and norms of the target culture, which underlie the pragmatic competence, has not been...
متن کاملImplicit Temporal Differences
In reinforcement learning, the TD(λ) algorithm is a fundamental policy evaluation method with an efficient online implementation that is suitable for large-scale problems. One practical drawback of TD(λ) is its sensitivity to the choice of the step-size. It is an empirically well-known fact that a large step-size leads to fast convergence, at the cost of higher variance and risk of instability....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Nature communications
دوره 5 شماره
صفحات -
تاریخ انتشار 2014